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ABSTRACT. The folding of the a-helice domain hbSBD of the mammalian mito- 
chondrial branched- chain a-ketoacid dehydrogenase (BCKD) complex is studied 
by the circular dichroism technique in absence of urea. Thermal denaturation 
is used to evaluate various thermodynamic parameters defining the equilibrium 
unfolding, which is well described by the two-state model with the folding tem- 
perature Tp = 317.8±1.95 K and the enthalpy change AHq = 19.67±2.67 kcal/mol. 
The folding is also studied numerically using the off-lattice coarse-grained Go 
model and the Langevin dynamics. The obtained results, including the popula- 
tion of the native basin, the free energy landscape as a function of the number 
of native contacts and the folding kinetics, also suggest that the hbSBD domain 
is a two-state folder. These results are consistent with the biological function of 
hbSBD in BCKD. 
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Introduction 

Understanding the dynamics and mechanism of protein folding remains one of the most 
challenging problems in molecular biology [1]. Single domain a proteins attract much at- 
tention of researchers because most of them fold faster than (3 and a(3 proteins 0, [J due to 
relatively simple energy landscapes and one can, therefore, use them to probe main aspects 
of the funnel theory [4J. Recently, the study of this class of proteins becomes even more 
attractive because the one-state or downhill folding may occur in some small a proteins 

The mammalian mitochondrial branched- chain a-ketoacid dehydrogenase (BCKD) com- 
plex catalyzes the oxidative decarboxylation of branched-chain a-ketoacids derived from 
leucine, isoleucine and valine to give rise to branched-chain acyl-CoAs. In patients with 
inherited maple syrup urine disease, the activity of the BCKD complex is deficient, which is 
manifested by often fatal acidosis and mental retardation 8j. The BCKD multi-enzyme com- 
plex (4,000 KDa in size) is organized about a cubic 24-mer core of dihydrolipoyl transacylase 
(E2), with multiple copies of hetero-tetrameric decarboxylase (El), a homodimeric dihydro- 
genase (E3), a kinase (BCK) and a phosphatase attached through ionic interactions. The E2 
chain of the human BCKD complex, similar to other related multi-functional enzymes Q], 
consists of three domains: The amino-terminal lipoyl-bearing domain (hbLBD, 1-84), the 
interim E1/E3 subunit-binding domain (hbSBD, 104-152) and the carboxy-terminal inner- 
core domain. The structures of these domains serve as bases for modeling interactions of the 
E2 component with other components of a-ketoacid dehydrogenase complexes. The struc- 
ture of hbSBD (Fig. 1) has been determined by NMR spectroscopy, and the main function 
of the hbSBD is to attach both El and E3 to the E2 core lOj. The two-helix structure of 
this domain is reminiscent of the small protein BBL 1711 which may be a good candidate for 
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observation of downhill folding [5J, [6(. So the study of hbSBD is interesting not only because 
of the important biological role of the BCKD complex in human metabolism but also for 
illuminating folding mechanisms. 

From the biological point of view, hbSBD could be less stable than hbLBD and one of 
our goals is, therefore, to check this by the circular dichroism (CD) experiments. In this 
paper we study the thermal folding-unfolding transition in the hbSBD by the CD technique 
in the absence of urea and pH=7.5. Our thermodynamic data do not show evidence for 



2 



the downhill folding and they are well fitted by the two-state model. We obtained folding 
temperature Tp = 317.8 ±1.95 K and the transition enthalpy AHq = 19.67±2.67 kcal/mol. 
Comparison of such thermodynamic parameters of hbSBD with those for hbLBD shows that 
hbSBD is indeed less stable as required by its biological function. However, the value of AHq 
for hbSBD is still higher than those of two-state a proteins reported in [3j , which indicates 
that the folding process in the hbSBD domain is highly cooperative. 

From the theoretical point of view it is very interesting to establish if the two-state 
foldability of hbSBD can be captured by some model. The all-atom model would be the best 
choice for a detailed description of the system but the study of hbSBD requires very expensive 



CPU simulations. Therefore we employed the off-lattice coarse-grained Go-like model 
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which is simple and allows for a thorough characterization of folding properties. In this model 
amino acids are represented by point particles or beads located at positions of C„ atoms. 
The Go model is defined through the experimentally determined native structure |10|, and 



it captures essential aspects of the important role played by the native structure [12, 

It should be noted that the Go model by itself can not be employed to ascertain the 
two-state behavior of proteins. However, one can use it in conjunction with experiments 
providing the two-state folding because this model does not always provide the two-state 
behavior as have been clearly shown in the seminal work of Clementi et al. 12j. In fact, the 



Go model correctly captures not only the two-state folding of proteins CI2 and SH3 (more 

two-state Go folders may be found in Ref. llJl) but also intermediates of the three-state 

□ 

folder barnase, RNAse H and CheY [12j|. The reason for this is that the simple Go model 
ignores the energetic frustration but it still takes the topological frustration into account. 
Therefore, it can capture intermediates that occur due to topological constraints but not 
those emerging from the frustration of the contact interactions. 

With the help of Langevin dynamics simulations and the histogram method 15| we have 
shown that, in agreement with our CD data, hbSBD is a two-state folder with a well-defined 
transition state (TS) in the free energy landscape. The two helix regions were found to 
be highly structured in the TS. The two-state behavior of hbSBD is also supported by 
our kinetics study showing that the folding kinetics follows the single exponential scenario. 
The two-state folding obtained in our simulations suggests that for hbSBD the topological 
frustration is more important than the energetic factor. 

The dimensionless quantity, Q c [jij, which characterizes the structural cooperativity of 
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the thermal denaturation transition was computed and the reasonable agreement between 
the CD experiments and Go simulations was obtained. Incorporation of side chains may 
give a better agreement [16|, ll7| but this problem is beyond the scope of the present paper. 



Materials and Methods 

Sample Preparation 

hbSBD protein was purified from the BL21(DE3) strain of E. coli containing a plasmid that 
carried the gene of hbLBD(l-84), a TEV cleavage site in the linker region, and hbSBD (104- 
152), generously provided to us by Dr. D.T. Chuang of the Southwestern Medical Center, 
University of Texas. There is an extra glycine in front of Glul04 which is left over after TEV 
cleavage, and extra leucine, glutamic acid at the C-terminus before six histidine residues. 
The protein was purified by Ni-NTA affinity chromatography, and the purity of the protein 
was found to be better than 95%, based on the Coomassie blue-stained gel. The complete 
sequence of N = 52 residues for hbSBD is 

(G)EIKGRKTLATPAVRRLAMENNIKLSEVVGSGKDGRILKEDILNYLEKQT(L)(E). 
Circular Dichroism 

CD measurements were carried out in Aviv CD spectrometer model 202 with temperature 
and stir control units at different temperature taken from 260nm to 195nm. All experi- 
ments were carried at 1 nm bandwidth in 1.0 cm quartz square cuvette thermostated to 
±0.1°C. Protein concentration (~ 50 uM) was determined by UV absorbance at 280nm us- 
ing e280nm=1280 M _1 cm _1 with 50mM phosphate buffer at pH7.5. Temperature control was 
achieved using a circulating water bath system, and the equilibrium time was three minutes 
for each temperature point. The data was collected at each 2K increment in temperature. 
The study was performed at heating rate of 10°C/min and equilibration time of 3 minutes. 
The volume changes as a result of thermal expansion as well as evaporation of water were 
neglected. 

Fitting Procedure 

Suppose the thermal denaturation is a two-state transition, we can write the ellipticity as 

= 9n + (9 N -9 D )f N , (1) 
where 9jj and 9n are values for the denaturated and folded states. The fraction of the folded 
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conformation is expressed as 

fN = 



1 



1 + exp(-AG r /T) ' 
AG T = AH T -TAS T = AH G 
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Here AHq and AC P are jumps of the enthalpy and heat capacity at the mid-point tem- 
perature Tg (also known as melting or folding temperature) of thermal transition, respec- 
tively. Some other thermodynamic characterization of stability such as the temperature of 
maximum stability (Ts), the temperature with zero enthalpy (Th), and the conformational 
stability (AGs) at T s can be computed from results of regression analysis [3] 
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Using Eq. (CQ) - Eq. (jSJ) we can obtain all thermodynamic parameters from CD data. 

It should be noted that the fitting of Eq. [2] with AC P > allows for an additional cold 



denaturation 



20j at temperatures much lower than the room temperature . The temperature 



of such a transition, T' G , may be obtained by the same fitting procedure with an additional 
constraint of AHg < 0. Since the cold denaturation transition is not seen in Go models, to 
compare the simulation results to the experimental ones we also use the approximation in 



which AC P = 0. 



Simulation 



We use coarse-grained continuum representation for hbSBD protein, in which only the po- 
sitions of 52 C a -carbons are retained. We adopt the off-lattice version of the Go model [11] 
where the interaction between residues forming native contacts is assumed to be attractive 
and the non-native interactions - repulsive. Thus, by definition for the Go model the PDB 
structure is the native structure with the lowest energy. The advantage of this model is its 



simplicity which allows one to study model proteins in detail. Following Ref. 



12 



, we write 
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the energy of the Go-like model as 



E = K Ma+i ~ r 0i ,i + i) 2 + K ^ 0i ~ d < 

bonds angles 

+ {< ) [l-cos(A0 l )]+4 3) [l-cos3(A0,)]} 

dihedral 

NC NNC , ^ x 12 

+ 5>["$-«*ifl + £*(^J • («) 



i>j—3 i>j~ 3 



Here A0j = (pi — <p 0i , Rij = r 0i j/rij; r iji+ i, 6>j and 0j stand for the zth bond length between 
the zth and (z + l)th residues, the bond angle between the (i — l)th and zth bonds and 
the dihedral angle around the zth bond, respectively, is the distance between the ith 
and jth residues. Subscript "0" , "NC" and "NNC" refer to the native conformation, native 
contacts and non-native contacts, respectively. The first harmonic term keeps the chain 
connectivity, while the second and third terms represent the local angular interactions. Two 
last terms are non-local interactions, where the former includes native contact interactions 
and the latter is nonspecific repulsion between non-native pairs. We choose K r = 100e#, 
Kq = 20e h,K^ = £h,Kj/ = 0.56 ff, e i = = £h and C = 4 A, where en is the 



characteristic hydrogen bond energy 
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The nativeness of any configuration is measured by the number of native contacts Q. 
We define that the zth and jth residues are in the native contact if roy is smaller than a 
cutoff distance d c taken to be d c = 7.5 A, where r 0i j is the distance between the zth and 
jth residues in the native conformation. Using this definition and the native conformation 



of Ref. 



10 



we found that the total number of native contacts Q total is 62. To study the 



probability of being in the native state we use the following overlap function 21] 



1 N 

X = ( L2r «i - '<^ A v (7) 

^^ total ■ . ■ , , 

where Ay is equal to 1 if residues z and j form a native contact and otherwise and 9{x) 
is the Heaviside function. The argument of this function guarantees that a native contact 
between z and j is classified as formed when ry is shorter than 1.2roy 121 ] 

The overlap function x, which is one if the conformation of the polypeptide chain coin- 
cides with the native structure and zero for unfolded conformations, can serve as an order 
parameter for the folding-unfolding transition. The probability of being in the native state, 
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/jv, which can be measured by the CD and other experimental techniques, is defined as 
fN=<X>i where < ... > stands for a thermal average. 

The dynamics of the system is obtained by integrating the following Langevin equation 
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d 2 r .dr ,„ s 

where m is the mass of a bead, ( is the friction coefficient, F c = dE/df. The random force T 
is a white noise, i.e. < Ti(t)Tj(t f ) >= 2(k B T5ij5(t — t'), where % and j refer to components 
x, y and z. It should be noted that the folding thermodynamics does not depend on the 
enviroment viscosity (or on () but the folding kinetics depends on it [23]. We chose the 
dimensionless parameter £ = (^^Y^C = 8, where m is the mass of a bead and a is the 
bond length between successive beads. One can show that this value of ( belongs to the 
interval of the viscosity where the folding kinetics is fast. We have tried other values of ( 
but the results remain unchanged qualitatively. 

We measure time in units of tl = {ma 2 / tn) 1 ^ 2 ■ Using the typical value m = 3.10 -25 



kg 24], a = 4A and tu = 0.91 kcal/mol (this choice of en follows from the requirement 



that the simulated folding temperature coincides with the experimental one, see below) we 



obtained tl m 3 ps. The dynamics Eq. (JSJ) was solved by the Verlet algorithm [25] with 
the time step At = 0.005r^. All thermodynamic quantities are obtained by the histogram 
method [151 ] . 



Results 

CD Experiments 

The structure of hbSBD is shown in Figure 1. Its conformational stability is investigated in 
present study by analyzing the unfolding transition induced by temperature as monitored by 



CD, similar to that described previously 26|, |27|]. The reversibility of thermal denaturation 



was ascertained by monitoring the return of the CD signal upon cooling from 95°C to 22 °C; 
immediately after the conclusion of the thermal transition. The transition was found to be 
more than 80% reversible. Loss in reversibility to greater extent was observed on prolonged 
exposure of the sample to higher temperatures. This loss of reversibility is presumably due 
to irreversible aggregation or decomposition. Figure 2 shows the wavelength dependence of 
mean residue molar ellipticity of hbSBD at various temperatures between 278K and 363K. 
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In a separate study, the thermal unfolding transition as monitored by ellipticity at 228 nm 
was found to be independent of hbSBD concentration in the range of 2 uM to 10 uM. It 
was also found to be unaffected by change in heating rate between 2°C/min to 20°C/min. 
These observations suggest absence of stable intermediates in heat induced denaturation 
of hbSBD. A valley at around 220 nm, characteristics of the helical secondary structure is 
evident for hbSBD. 

Figure 3 shows the temperature dependence of the population of the native conformation, 
/at, for wave lengths A = 208, 212 and 222 nm. We first try to fit these data to Eq. (J2]) 
with AC P 7^ 0. The fitting procedure gives slightly different values for the folding (or 
melting) temperature and the enthalpy jump for three sets of parameters. Averaging over 
three values, we obtain Tq = 317.8 ± 1.95 K and AHq = 19.67 ± 2.67 kcal/mol. Other 
thermodynamic quantities are shown on the first row of Table 1. The similar fit but with 
AC P = gives the thermodynamic parameters shown on the second row of this table. Since 
the experimental data are nicely fitted to the two-state model we expect that the downhill 
scenario does not applied to the hbSBD domain. 

For the experimentally studied temperature interval two types of the two-state fit (J2]) 
with AC P = and AC P ^ give almost the same values for T G , AH G and ASq- However, 
pronounced different behaviors of the population of the native basin, f^, occur when we 
interpolate results to the low temperature region (Fig. 4). For the AC P = case, /jv 
approaches the unity as T — > but it goes down for AC P ^ 0. This means that the AC P ^ 
fit is valid if the second cold denaturation transition may occur at Tq- This phenomenon 
was observed in single domains as well as in multi-domain globular proteins 20J . We predict 
that the cold denaturation of hbSBD may take place at T' G ~ 212 K which is lower than 
T' G ?a 249.8 K for hbLBD shown on the 4th row of Table 1. It would be of great interest to 
carry out the cold denaturation experiments in cryo-solvent to elucidate this issue. 

To compare the stability of the hbSBD domain with the hbLBD domain which has been 
studied in detail previously we also present the thermodynamic data of the latter on 
Table 1. Clearly, hbSBD is less stable than hbLBD by its smaller AGs and lower Tq values. 
This is consistent with their respective backbone dynamics as revealed by 15 N-T 1; 15 N-T 2 , 
and 15 N- 1 H NOE studies of these two domains using uniformly 15 N-labeled protein samples 
(Chang and Huang, unpublished results). Biologically, hbSBD must bind to either El or 
E3 at different stages of the catalytic cycle, thus it needs to be flexible to adapt to local 
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enviroments of the active sites of El and E3. On the other hand, the function of hbLBD is 
to permit its Lys44 residue to channel acetyl group between donor and acceptor molecules 
and only the Lys44 residue needs to be flexible 28j|. In addition, the NMR observation for 
the longer fragment (comprising residues 1-168 of the E2 component) also showed that the 
hbLBD region would remain structured after several months while the hbSBD domain could 
de-grate in a shorter time. 

Folding Thermodynamics from Simulations 

In order to calculate the thermodynamics quantities we have collected histograms for the 
energy and native contacts at six values of temperature: T = 0.4,0.5,0.6,0.7,0.8 and 1.0 
e_ff//c_B- For sampling, at each temperature 30 trajectories of 16 x 10 7 time steps have been 
generated with initial 4 x 10 7 steps discarded for thermalization. The reweighting histogram 



method 



15| was used to obtain the thermodynamics parameters at all temperatures. 



Figure 4 (open circles) shows the temperature dependence of population of the native 
state, defined as the renormalized number of native contacts (see Material and Methods) for 
the Go model. Since there is no cold denaturation for this model, to obtain the thermody- 
namic parameters we fit fjq to the two-state model (Eq. [2]) with AC P = 0. 

The fit (black curve) works pretty well around the transition temperature but it gets worse 
at high T due to slow decay of J'n which is characteristic for almost all of theoretical models. 
In fitting we have chosen the hydrogen bond energy e# = 0.91 kcal/mol in Hamiltonian (J6]) 
so that Tq = 0.7en/kB coincides with the experimental value 317.8 K. From the fit we obtain 
AHg = 11.46 kcal/mol which is smaller than the experimental value indicating that the Go 
model is less stable compared to the real hbSBD. 

Figure 5 shows the temperature dependence of derivative of the fraction of native contacts 
with respect to temperature df^jdT (we also call this value the structural susceptibility) and 
the specific heat C v obtained from the Go simulations. The collapse temperature Tq, defined 
as the temperature at which C v is maximal, almost coincides with the folding temperature Tp 
at Tp the structural susceptibility has maximum). According to Klimov and Thirumalai 



29], the dimensionless parameter a = ^ r TF may serve as an indicator for foldablity of 



proteins. Namely, sequences with a < 0.1 fold much faster that those which have the same 
number of residues but with a exceeding 0.5. From this perspective, having a m hbSBD is 
supposed to be a good folder in silico. However, one has to be cautious about this conclusion 
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because the pronounced correlation between foldin g ti mes Tp and the equilibrium parameter 



a, observed for simple on- and off-lattice models 



24 



29] may be not valid for proteins in 



laboratory [30|]. In our opinion, since the data collected from theoretical and experimental 
studies are limited, further studies are required to clarify the relationship between Tp and 
a. 

Using experimental values for Tq (as Tp) and AHq and the two-state model with AC P = 
(see Table 1) we can obtain the temperature dependence of the population of native state 
/at and, therefore, df^/dT for hbSBD (Fig. 5). Clearly, the folding- unfolding transition in 
vitro is sharper than in the Go modeling. One of possible reasons is that our Go model 
ignores the side chain which can enhance the cooperativity of the denaturation transition 



16] . 



The sharpness of the fold-unfolded transition might be characterized quantitatively via 



the cooperativity index fi c which is defined as follows 31] 

_F 

AT \ dT y , ; 

where AT is the transition width. From Fig. 5, we obtain Q c = 51.6 and 71.3 for the Go 
model and CD experiments, respectively. Given the simplicity of the Go model used here the 
agreement in Q c should be considered reasonable. We can also estimate Q c from the scaling 



law suggested in Ref. ]32|, fi c = 0.0057 x iV M , where exponent // is universal and expressed 
via the random walk susceptibility exponent 7as/i=l+7~ 2.22(7 ~ 1.22). Then we get 
Q c ~ 36.7 which is lower than the experimental as well as simulation result. This means 
that hbSBD in vitro is, on average, more cooperative than other two-state folders. 



Another measure for the cooperativity is k 2 which is defined as 33] k 2 = AH v h/AH ca [, 



where AH vh = 2T max a/ k B C v (T max ) and AH cai = J °° C v (T)dT, are the van't Hoff 
and the calorimetric enthalpy, respectively, Cy{T) is the specific heat. Without the baseline 



substraction in Cy(T) [34j, for the Go model of hbSBD we obtained K2 ~ 0.25. Applying the 
baseline substraction as shown in the lower part of Fig. 5 we got K2 ~ 0.5 which is still much 
lower than k 2 ~ 1 for a trully all-or-none transition. Since n 2 is an extensive parameter, its 
low value is due to the shortcomings of the off-lattice Go models but not due to the finite 
size effects. More rigid lattice models give better results for the calorimetric cooperativity 



171 ] . Thus, for the hbSBD domain the Go model gives the better agreement with our CD 



experiments for the structural cooperativity fl c than for the calorimetric measure k 2 . 
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Free Energy Profile 

To get more evidence that hbSBD is a two-state folder we study the free energy profile 
using some quantity as a reaction coordinate. The precise reaction coordinate for a multi- 
dimensional process such as protein folding is difficult to ascertain. However, Onuchic and 
coworkers 35[ have argued that, for minimally frustrated systems such as Go models, the 
number of native contact Q may be appropriate. Fig. 6(a) shows the dependence of free 
energy on Q for T = Tp. Since there is only one local maximum corresponding to the 
transition state (TS), hbSBD is a two-state folder. This is not unexpectable for hbSBD 
which contains only helices. The fact that the simple Go model correctly captures the two- 
state behavior as was observed in the CD experiments, suggests that the energetic frustration 
ignored in this model plays a minor role compared to the topological frustration 12] . 

We have sorted out structures of the denaturated state (DS), TS and the folded state (FS) 
at T = Tp generating 10 4 conformations in equilibrium. The distributions of the RMSD, 
-Prmsd, of these states are plotted in Fig. 6(b). As expected, Prmsd for the DS spreads out 
more than that for the TS and FS. According to the free energy profile in Fig. 6(a), the 
TS conformations have 26 - 40 native contacts. We have found that the size (number of 
folded residues) [36] of the TS is equal to 32. Comparing this size with the total number 
of residues (N = 52) we see that the fraction of folded residues in the TS is higher than 
;he typical value for real two-state proteins 36(. This is probably an artefact of Go models 
37| |. The TS conformations are relatively compact having the ratio < R^ s > /R^ s ~ 1.14, 
where < R^ s > is the average radius of gyration of the TS ensemble and Rg is the radius 
of gyration of the native conformation shown in Fig. 1. Since the RMSD, calculated only 
for two helices, is about 0.8 Athe structures of two helices in the TS are not distorted much. 
It is also evident from the typical structure of the TS shown in Fig. 6(b) where the helix 
regions Hx and H 2 involve residues 13 - 19 and 39 - 48, respectively (a residue is considered 
to be in the helix state if its dihedral angle is about 60°). Note that H4 has two residues 
less compared to Hx in the native conformation (see the caption to Fig. 1) but H 2 has even 
one bead more than its native state counterpart. Overall, the averaged RMSD of the TS 
conformations from the native conformation (Fig. 1) is about 4.9 A indicating that the TS 
is not close to the native one. As seen from Figs. 6(a) and 1, the main difference comes from 
the tail parts. The most probable conformations (corresponding to maximum of Prmsd in 
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Fig. 6(b) of the FS have RMSD about 2.5 A. This value is reasonable from the point of 
view of the experimental structure resolution. 

Folding Kinetics 

The two-state foldability, obtained from the thermodynamics simulations may be also probed 
by studying the folding kinetics. For this purpose we monitored the time dependence of the 
fraction of unfolded trajectories P u (t) defined as follows 38] 

P u (t) = 1 - / P£ {s)ds, (10) 
Jo 

where Pfl is the distribution of first passage folding times 



fp 

M 

fp ~ M 



1=1 

Here Tf t u is time for the ith trajectory to reach the native state for the first time, M is 
the total number of trajectories used in simulations. A trajectory is said to be folded if all 
of native contacts form. As seen from Eqs [TD] and [TTJ, P u (t) is the fraction of trajectories 
which do not reach the native state at time t. In the two-state scenario the folding becomes 
triggered after overcoming only one free energy barrier between the transition state and the 
denaturated one. Therefore, P u (t) should be a single exponential, i.e. P u {t) ~ exp(— t/rp) 
(a multi-exponential behavior occurs in the case when the folding proceeds via intermedi- 
ates) 38j. Since the function P u (t) can be measured directly by a number of experimental 



techniques [39|, |40j, the single exponential kinetics of two-state folders is supported by a 
large body of experimental work (see, i.e. Ref. 26j and references there). 

Fig. 7 shows the semi- logarithmic plot for P u (t) at T = Tp for the Go model. Since 
the single exponential fit works pretty well, one can expect that intermediates do not occur 
on the folding pathways. Thus, together with the thermodynamics data our kinetic study 
supports the two-state behavior of the hbSBD domain as observed on the CD experiments. 

From the linear fit in Fig. 7 we obtain the folding time Tp « 0.1/xs. This value is 
consistent with the estimate of the folding time defined as the average value of the first 
passage times. If we use the empirical formula for the folding time Tp = r F exp(l.lA r1 / 2 ), 
where pref actor r F = 0.4/xs and N is a number of amino acids 3JJ then Tp = 1.1 x 10 3 /is 
for N = 52. This value is about four orders of magnitude larger than that obtained from 
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the Go model. Thus the Go model can capture the two-state feature of the denaturation 
transition for hbSBD domain but not folding times. 



Discusion 

We have used CD technique and the Langevin dynamics to study the mechanism of folding 
of hbSBD. Our results suggest that this domain is a two-state folder. The CD experiments 
reveal that the hbSBD domain is less stable than the hbLBD domain in the same BCKD 
complex, but it is more stable and cooperative compared to other fast folding a proteins. 

Both the thermodynamics and kinetics results, obtained from the Langevin dynamics 
simulations, show that the simple Go model correctly captures the two-state feature of 
folding. It should be noted that the two-state behavior is not the natural consequence of the 
Go modeling because it allows for fishing folding intermediates caused by the topological 
frustration. From this standpoint it may be used to decipher the foldability of model proteins 
for which the topological frustration dominates. The reasonable agreement between the 
results obtained by the Go modeling and our CD experiments, suggests that the native 
state topology of hbSBD is more important than the energetic factor. 

The theoretical model gives the reasonable agreement with the CD experimental data for 
the structural cooperativity Q c . However, the calorimetric cooperativity criterion k 2 ~ 1 for 
two-state folders is hard to fulfill within the Go model. From the AC P 7^ fitting procedure 
we predict that the cold denaturation of hbSBD may occur at T m 212 K and it would be 
very interesting to verify this prediction experimentally. We are using the package SMMP 
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42| | and a parallel algorithm 43j| to perform all-atom simulation of hbSBD to check the 



relevant results. 
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Domain 


T G (K) 


AH G 
(kcal/mol/K) 


AC P 
(kcal/mol/K) 


AS a 
(cal/mol/K) 


T S (K) 


T H (K) 


AGs 
(kcal/mol) 


T Q {K) 


SBD(exp) 


317.8 ±1.9 


19.67 ±2.67 


0.387 ±0.054 


61.64 ±7.36 


270.9 ± 2.0 


267.0 ±2.1 


1.4 ±0.1 


212 ±2.5 


SBD(exp) 


317.9 ±2.2 


20.02 ±3.11 


0.0 


62.96 ±9.92 










SBD(sim) 


317.9 ± 7.95 


11.46 ±0.29 


0.0 


36.05 ± 1.85 










LBD(exp) 


344.0 ± 0.2 


78.96 ±1.28 


1.51 ±0.04 


229.5 ±3.7 


295.7 ±3.7 


291.9 ± 1.3 


5.7 ±0.2 


249.8 ± 1.1 



TABLE 1. Thermodynamic parameters obtained from the CD experiments and simulations 
for hbSBD domain. The results shown on the first and fourth rows were obtained by fitting 
experimental data to the two-state equation <^ with AC P ^ 0. The second and third rows 



corresponding to the fit with AC P = 0. The results for hbLBD are taken from Ref. 
comparison. 



27 for 
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Figure Captions 



FIGURE 1. Ribbon representation of the structure of hbSBD domain. The helix region 
Hi and H 2 include residues Prol2 - Glu20 and Lys39 - Glu47, respectively. 

FIGURE 2. Dependence of the mean residue molar ellipticity on the wave length for 18 
values of temperatures between 278 and 363 K. 

FIGURE 3. Temperature dependence of the fraction of folded conformations /jv, obtained 
from the ellipticity 9 by Eq. (J2J), for wave lengths A = 208 (blue circles), 212 (red squares) 
and 222 nm (green diamonds). The solid lines corresponds to the two state fit given by Eq. 
[2] with AC P ^ 0. We obtained T G = T F = 317.8 ± 1.9 K, AH G = 19.67 ± 2.67 kcal/mol and 
AC P = 0.387 ±0.054. 

FIGURE 4. The dependence of Jn for various sets of parameters. The blue and red curves 
correspond to the thermodynamic parameters presented on the first and the second rows 
of Table 1, respectively. Open circles refer to simulation results for the Go model. The 
solid black curve is the two-state fit (AC P = 0) which gives AH G = 11.46 kcal/mol and 
Tp = 317.9. 

FIGURE 5. The upper part refers to the temperature dependence of df^/dT obtained by 
the simulations (red) and the CD experiments (blue). The experimental curve is plotted 
using two-state parameters with AC P = (see, the second row on Table 1). The temperature 
dependence of the heat capacity Cy{T) is presented in the lower part. The dotted lines 
illustrate the base line substraction. The results are averaged over 20 samples. 

FIGURE 6. (a) The dependence of free energy on the number of native contacts Q at 
T = Tp. The typical structures of the denaturated state , transition state and folded state 
are also drawn. The helix regions Hi (green) and H2 (orange) of the TS structure involve 
residues 13 - 19 and 39 - 48, respectively. For the FS structure H 2 is the same as for the 
TS structure but Hi has two residues more (13 - 21). (b) Distributions of RMSD for three 
ensembles shown in (a). The average values of RMSD are equal to 9.8, 4.9 and 3.2 A for 
the DS, TS and FS, respectively. 
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FIGURE 7. The semi-logarithmic plot of the time dependence of the fraction of unfolded 
trajectories at T = T F . The distribution P u (t) was obtained from first passage times of 400 
trajectories, which start from random conformations. The straight line corresponds to the 
fit In P u (t) = —t/r F , where t f = 0.1/xs. 
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